An Extensible Framework for Query Optimization on TripleT-based RDF Stores

نویسندگان

  • Bart G. J. Wolff
  • George H. L. Fletcher
  • James J. Lu
چکیده

The RDF data model is a key technology in the Linked Data vision. Given its graph structure, even relatively simple RDF queries often involve a large number of joins. Join evaluation poses a significant performance challenge on all state-of-the-art RDF engines. TripleT is a novel RDF index data structure, demonstrated to be competitive with the current state-of-the-art for join processing. Query optimization on TripleT, however, has not been systematically studied up to this point. In this paper we investigate how the use of (i) heuristics and (ii) data statistics can contribute towards a more intelligent way of generating query plans over TripleT-based RDF stores. We propose a generic framework for query optimization, and show through an extensive empirical study that our framework consistently produces efficient query evaluation plans.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sesame: An Architecture for Storin gand Querying RDF Data and Schema Information

RDF and RDF Schema provide the first W3C standard to enrich the Web with machine-processable semantic data. However, to be able to use this semantic data, a scalable, persistent RDF store and a powerful query engine using an expressive query language are needed. Sesame is an extensible architecture implementing both of these. Sesame can be based on arbitrary repositories, ranging from tradition...

متن کامل

Cost Analysis of Joins in RDF Query Processing Using the TripleT Index

Cost Analysis of Joins in RDF Query Processing Using the TripleT Index By Kanwei Li The Semantic Web movement has led to a growing popularity of RDF and its query languages. Clearly, good query performance is important in allowing information to be quickly retrieved from RDF datasets that are everincreasing in size. We use the TripleT indexing scheme for RDF data as a framework to examine the c...

متن کامل

A Framework for SPARQL Query Processing, Optimization and Execution with Illustrations

The vision of Semantic web is to allow intelligent description and interchange of integrated data from various distributed web resources. A structure for this metadata on web is known as Resource Description Framework (RDF) where data is in the form of XML (Extended Markup Language). A query language is used to retrieve such large RDF data effectively and efficiently which is known as SPARQL (S...

متن کامل

Efficient Query Processing in Distributed RDF Databases

The Semantic Web takes the concept of data integration in a mediator architecture to a new level. Since the distribution of data is not managed centrally, query processing over a set of autonomous data stores poses new challenges for query optimization. Because URIs provide a global resource identification mechanism and RDF data stores have no schema, all data stores can potentially have inform...

متن کامل

Towards Efficient SPARQL Query Processing on RDF Data

Efficient support for querying large-scale RDF triples plays an important role in Semantic Web data management. This paper proposes an efficient RDF query engine to evaluate SPARQL queries, where the inverted index structure is employed for indexing RDF triples. We first design and implement a set of operators on the inverted index for query optimization and evaluation. Then we propose a main-t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015